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CONVERGENCE  RATE  OF  A  GRAM-SCHMIDT  CANCELLER 
I.  INTRODUCTION 

The  open-loop  Gram-Schmidt  (GS)  technique  for  adaptive  cancellation  [1-6]  has  been  shown  to 
yield  superior  performance  simultaneously  in  arithmetic  efficiency,  stability,  and  convergence  times 
over  other  adaptive  algorithms.  Arithmetic  efficiency  results  from  using  systolic  processing  architec¬ 
tures  that  take  advantage  of  the  GS  structure.  In  addition,  the  stability  of  the  GS  algorithm  is 
enhanced  because  it  does  not  require  the  direct  calculation  of  an  inverse  covariance  matrix  as  does  the 
Sampled  Matrix  Inversion  (SMI)  algorithm  [7].  Also,  the  GS  canceller  algorithm  is  very  suitable  for 
a  nonstationary  noise  environment  because  the  adaptive  weights  can  be  updated  in  a  numerically  effi¬ 
cient  manner,  by  using  “sliding  window”  techniques  on  the  input  data  instead  of  “batch”  or 
“block”  processing.  Two  types  of  batch  processing  techniques  are  concurrent  processing  and  non¬ 
concurrent  processing.  For  concurrent  processing,  the  adaptive  weights  are  calculated  from  an  input 
data  set  and  reapplied  to  the  same  input  data  set.  For  nonconcurrent  processing,  the  weights  are 
applied  to  a  different  data  set. 

The  optimal  weights  associated  with  an  adaptive  canceller  are  generally  not  known  a  priori  and 
thus  must  be  estimated  by  using  finite  averaging.  Because  of  the  use  of  estimated  weights,  subop- 
timal  canceller  performance  results.  Reed,  Mallet,  and  Brennan  [7,8]  quantified  this  performance  for 
the  SMI  algorithm  in  the  transient  state  under  certain  input  conditions,  one  of  these  being  that  the 
input  noise  must  be  Gaussian.  They  mathematically  demonstrated  that  the  SMI  canceller  has  rela¬ 
tively  fast  convergence  characteristics  and  also  that  the  convergence  is  independent  of  the  input 
covariance  matrix. 

In  this  report,  we  show  that  the  GS  canceller  algorithm  is  numerically  identical  with  the  SMI 
algorithm  in  the  transient  state  if  infinite  numerical  accuracy  is  assumed  (Section  IV).  By  transient 
state,  we  mean  that  a  finite  number  of  time-coincident  samples  per  channel  are  used  to  obtain  an  esti¬ 
mate  of  the  optimal  weights.  Thus  the  convergence  rate  or  any  other  measures  of  effectiveness  of  the 
two  algorithms  in  the  transient  state  are  identical.  In  addition,  we  reproduce  many  of  the  results  of 
Refs.  7  and  8  by  using  the  GS  canceller  structures  as  an  analysis  tool  (Sections  V  to  VIII).  Also, 
new  results  are  generated  for  the  case  when  the  input  noises  are  not  necessarily  Gaussian  (Sections  IX 
and  X).  Furthermore,  various  results  are  presented  for  the  three  forms  of  the  GS  canceller  (con¬ 
current  processor,  nonconcurrent  processor,  and  sliding  window  processor).  In  particular,  it  is  shown 
that  the  sliding  window  GS  canceller  is  convergent,  equivalent  in  the  transient  state  with  the  con¬ 
current  GS  canceller  (Section  XI).  Section  XII  discusses  the  deleterious  effect  of  “overmatching  the 
degrees  of  freedom.” 

Note  that  the  analysis  presented  in  this  report  pertains  to  the  adaptive  processor  in  the  “sidelobe 
canceller”  (SLC)  configuration,  where  the  desired  signal  is  assumed  to  be  present  only  in  the  main 
channel  and  auxiliary  channels  are  used  to  cancel  correlated  noises  in  the  main  channel.  However,  as 
is  shown  in  Ref.  7,  any  nonconstrained,  linear  adaptive  array  processor  can  be  transformed  into  an 
SLC  configuration  without  changing  the  convergence  properties.  Hence,  the  results  of  this  report 
apply  to  any  nonconstrained  linear  adaptive  array  processor. 

Manuscript  approved  March  9,  1989. 
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H.  THE  GS  CANCELLER 

Consider  the  general  jV -input  GS  canceller  structure  as  seen  in  Fig.  1(a).  Let  xM(t), 
x )(r)  ,  xN_\(t)  represent  the  complex  data  in  the  Oth,  1st,  ....  N  -  1th  channels,  respectively. 

We  call  the  leftmost  input  xM(t)  the  main  channel,  and  we  call  the  remaining  N  —  1  inputs  the  auxili¬ 
ary  channels.  The  main  channel’s  signal  consists  of  a  desired  signal  plus  additive  noise.  The  noise 
consists  of  internal  noise  plus  external  noise.  Cancellation  of  the  signals  from  external  interfering 
sources  relies  on  the  correlation  of  simultaneously  received  signals  in  the  main  and  auxiliary  channels. 
The  internal  noises  on  each  channel  are  assumed  uncorrelated  between  channels.  The  canceller 
operates  so  as  to  decorrelate  the  auxiliary  inputs  one  at  a  time  from  the  other  inputs  by  use  of  the 
basic  two-input  GS  processor  as  is  shown  in  Fig.  1(b).  For  example.  Fig.  1(a)  shows  that  jtV-|(/ )  is 
decorrelated  with  xM(t),  X[(f),  •  ••  ,  xN_2({)  *n  ^,rst  level  °f  decomposition.  Next,  the  output 
channel  that  results  from  decorrelating  xN_  [(/)  with  xN  _2(f )  is  decorrelated  with  the  other  outputs  of 
the  first-level  GSs.  The  decomposition  proceeds  until  a  final  output  channel  is  generated.  If  the 
decorrelation  weights  in  each  of  the  two-input  GSs  are  computed  from  an  infinite  number  of  input 
samples,  this  output  channel  is  totally  decorrelated  with  the  input:  Xj(r),  x2(t),  . . .  ,  xN_x(t). 


MAIN 

CHANNEL  AUXILIARY  CHANNELS 


Fig  1(a)  —  GS  structure 
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Fig.  1(b)  —  Basic  two-input 
GS  canceller 


If  there  is  not  an  infinite  number  of  input  samples,  the  decorrelation  weights  associated  with 
each  two-input  GS  canceller  are  estimated  by  using  finite  averaging.  In  this  section  we  discuss  two 
methods  of  processing  data  through  the  GS  canceller.  The  first  is  called  concurrent  processing 
whereby  the  weights  are  estimated  from  a  block  of  input  data  and  applied  back  onto  the  same  input 
data  set.  The  second  method  is  called  nonconcurrent  processing  whereby  the  weights  are  estimated 
from  a  block  of  input  data  and  applied  to  subsequent  or  previous  input  data.  Inherent  in  both  tech¬ 
niques  is  the  “block  processing”  of  data.  (In  Section  XI  we  consider  another  form  of  the  GS  can¬ 
celler;  it  uses  sliding  window  or  systolic  techniques.)  Reference  8  shows  that  the  average  output 
noise  power  residue  can  vary  quite  differently,  depending  on  whether  concurrent  or  nonconcurrent 
processing  is  used. 

We  now  briefly  describe  the  concurrent  and  nonconcurrent  GS  canceller.  For  the  concurrent 
canceller,  let  jc„(m)  represent  the  time-coincident  outputs  of  the  two-input  GSs  on  the  (m  —  l)th  level. 
Then  outputs  of  the  two-input  GSs  at  the  mth  level  are  given  by 


r(m  +  l)  (m)  _ 
n 


»„(m)  xfrjm. 


n  =0, 1 . N  -  m  -  1, 

m  =  1,2,. . . ,  N  -  1. 


(2.1) 


Note  that  xq}  =  xM  and  .x„(1)  =  x n  =  1,  2,  . . .  N  —  1.  The  weight  w„(m),  seen  in  Eq.  (2.1),  is 
computed  so  as  to  decorrelate  x^m)  with  x^mJm .  For  K  input  samples  per  channel,  this  weight  is 
estimated  as 


£  xjr?m  (k)xlm\k) 

k  =1 _ 

£  i xjrim{k)\2 

*= i 


(2.2) 


where  *  denotes  the  complex  conjugate  and  |  •  |  is  the  magnitude.  Here  k  indexes  the  time- 
coincident  sampled  data. 


For  the  nonconcurrent  canceller,  let  X*m>  represent  the  outputs  of  the  two-input  GSs  on  the 
( m  -  l)th  level.  Then  the  outputs  of  the  two-input  GSs  at  the  mth  level  are  given  by 
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v  (m  +  I )  _  y  (m ) 
™ n 


W 


im)  xfTJm , 


n  =0,  1,...  .  N  —  m  -  1 
m  =  1,2 . N  -  1, 


(2.3) 


where  X^11  =  XM ,  X„(,)  =  X„,  n  =  1,2,..  A/  -  1,  and  w„(m)  is  calculated  by  the  use  of  Eq.  (2.2); 

i.e.,  these  weights  are  computed  from  a  block  of  data  that  does  not  include  Xn . 


Let  jc0  and  X0  represent  the  additive  noises  in  the  main  channel  for  concurrent  and  noncon¬ 
current  processing,  respectively.  For  this  development,  unless  otherwise  noted,  we  make  the  follow¬ 
ing  assumptions: 


I.  The  jc0,  jc | ,  ...  ,  %_i  and  X0,  Xt . XN_x  are  identically  distributed  Gaussian  com¬ 

plex  random  variables  (r.v.). 


2.  These  same  r.v.’s  are  samples  from  stationary  processes  with  zero  mean  and  equal  vari¬ 
ance. 


3.  For  k2 ,  xn(k ()  is  independent  of xn{k2)  and  X„(k()  is  independent  of  X„(k2). 

4.  ■*„,(£()  is  independent  of  X„  (k2)  for  all  kh  k2,  nl%  n2. 

5.  The  desired  signals  are  not  present  during  weight  computation  for  nonconcurrent  process¬ 
ing. 

6.  The  desired  signals  are  not  present  in  the  auxiliary  channels. 

Note  that  in  Sections  IX  and  X,  we  remove  the  assumption  that  the  r.v.’s  are  Gaussian. 

The  following  definition  is  used  often  in  the  upcoming  development.  A  normalized  L -length 
multivariate  complex  circular  Gaussian  vector  has  L  elements,  each  of  which  has  real  and  imaginary 
parts  that  are  independent  Gaussian  r.v.’s  with  0  mean  and  variance  equal  to  1/2  (note  the  magnitude 
variance  is  one).  In  addition,  the  L  elements  are  independent  of  one  another. 

III.  OUTPUT  MEASURES 


The  N -input  GS  canceller  structures  for  concurrent  and  nonconcurrent  processing  are  simplified 
by  the  representations  as  seen  in  Fig.  2(a).  There  are  .5 N  (N  -  1)  weights  computed  in  the  GS 
structure.  We  call  these  weights  the  GS  interior  weights.  The  notation  GS*  N  indicates  that  an  N- 
input  GS  structure  uses  K  samples  from  each  channel  to  compute  the  GS  interior  weights  in  the  GS 
structure.  Note  that  for  the  nonconcurrent  structure  the  weights  are  computed  from  the  x0,  Xj,  . . .  , 
xs_x  data  block  and  applied  to  X0,  Xx,  ....  X,v_,.  The  0th  channel  (or  the  far  left  channel  in  Fig. 
2(a))  is  always  designated  as  the  main  channel,  and  the  others  are  called  auxiliary  channels  (or 
AUXs).  The  output  of  the  concurrent  (weighting)  processor  is  denoted  by  zf.H.,  and  the  output  of  the 
nonconcurrent  processor  is  denoted  by  Znw.  We  also  represent  the  GS  structure  as  shown  in  Fig. 
2(b),  where  the  N  orthogonal  outputs  are  displayed. 

As  previously  mentioned,  input  signals  in  the  main  channel  consist  of  a  desired  signal  .v  plus 
noise  x0.  For  nonconcurrent  processing,  it  is  assumed  that  the  desired  signal  passes  from  input  to 
output  unperturbed.  However,  for  concurrent  processing  the  presence  of  the  signal  in  the  main  chan¬ 
nel  causes  signal  cancellation  through  the  GS  canceller  as  reported  in  Ref.  8.  Because  of  linearity, 
the  GS*  /v  canceller  can  be  decomposed  as  shown  in  Fig.  3  (see  Theorem  2,  Section  IV).  The  left- 
hand  GS*  N  canceller  seen  in  this  figure  has  only  the  desired  signal  in  the  main  channel,  and  the 
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! 

I 


Fig.  2(a)  —  Representations  of  concurrent  and  nonconcurrent 
weighting  of  GS  cancellers 


AUX  CHANNELS 


Fig.  2(b)  —  GS  representation  with  N 
output  channels 
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S  *1  XN  -  1  x0  X1  XN  -  1 


Fig.  3  —  Decomposition  of  signal  and  noise  for 
concurrent  processing 


right-hand  GSK  N  has  only  noise  x0  in  the  main  channel.  Note  that  the  interior  weights  of  each 
GS*  N  are  not  identical  because  of  the  different  main  channel  input  in  each  (actually  only  the  weights 
along  the  main  channel  path  of  each  differ).  As  will  be  seen,  we  use  this  decomposition  when  defin¬ 
ing  the  output  noise-to-signal  power  ratio  (NSR). 

It  can  be  shown  that  for  any  set  of  GS  interior  weights  that  are  estimated  and  applied  to  the  aux¬ 
iliary  input  channels  there  is  an  equivalent  linear  weighting  of  the  input  auxiliary  channels.  We 
denote  this  equivalent  linear  weighting  by  the  ( N  -  l)-length  vector  w„.  where 


Thus,  the  outputs  of  the  GS  processed  main  channel  are  identical  to  the  outputs  of  a  main  channel 
derived  by  subtracting  the  linear  weighted  auxiliary  channels  from  the  main  channel  input.  With 
respect  to  the  decomposition  configuration  seen  in  Fig.  3,  we  see  that  wti  is  actually  the  sum  ot  two 
(N  - 1  )-length  weighting  vectors  w„  ,  and  w„„.  where  w(J  v  is  the  auxiliary  linear  weighting  vector 
associated  with  only  the  desired  signal  in  the  main  channel  and  w„  „  is  the  auxiliary  linear  weighting 
vector  associated  with  only  noise  .t0  in  the  main  channel.  Note  that  as  K  —  oo.  w<(  t  —  0:  also  tor 
nonconcurrent  processing,  w(J  =  w„  „.  For  the  GS  canceller  the  weighting  on  the  main  channel  is 
constrained  to  be  one. 

Let  <7^in  be  the  steady  state  (K  —  oo)  output  noise  power  residue  and  let  SNRop,  be  the  steady 
state  output"  signal-to- noise  ratio.  Note  that  the  a^in  is  identical  for  both  concurrent  and  noncon¬ 
current  processing  as  is  SNR,,^.  Here 

Ra  is  the  steady  state  (iV  -  l)  X  (/V  -  1)  input  noise  convariance  matrix  ot  the 
auxiliary  channels. 

Ra  is  the  estimated  auxiliary  input  covariance  matrix  using  i v: . fv-i  dafa 

( K  samples  per  input  channel). 
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is  the  estimated  auxiliary  input  noise  covariance  matrix  using  X,,  X2,  ...  ,  Xw  _ , 
(note,  no  desired  signal  assumed  in  this  calculation), 

is  the  transient  output  noise  power  associated  with  nonconcurrent  weighting 
normalized  by  dividing  by  ff^n, 

is  the  transient  output  noise  power  associated  with  nonconcurrent  weighting 
normalized  to  and  averaged  over  X0,  X, . Xn-i* 


SNR„m.  is  the  transient  output  ratio  SNR  associated  with  nonconcurrent  weighting 
normalized  by  dividing  by  SNRop,  and  averaged  over 
X0,  X,,  ...  ,  X^ _ i , 

|  s '  | 2  is  the  transient  output  signal  power  associated  with  concurrent  weighting 
normalized  by  dividing  by  the  input  desired  signal  power  |  s  | 2, 


is  the  transient  output  noise  power  associated  with  concurrent  weighting 
normalized  by  dividing  by  and 

NSR(  M  is  the  transient  output  noise-to-signai  power  ratio  associated  with  concurrent 
weighting  normalized  by  dividing  by  SNRop,. 

Note  that  the  last  eight  quantities  defined  are  r.v.’s.  NSRfH.  is  defined  as  a  noise-to-signal  ratio 
because  it  is  easier  to  obtain  an  analytical  result  pertaining  to  this  quantity  as  opposed  to  the  SNR. 

By  using  the  above  definitions,  it  can  be  shown  that 

I  2 


-2 


|Z„H- 


=  l*ol 


(3.2a) 


and 


5.2 


=  Ex\  *L\  = 


El  l^o I2!  - 


(3.2b) 


where  E\  )  denotes  the  expected  value  and  Ex\  \  denotes  that  the  expectation  is  taken  over  the  r.v.’s 
X0,x . .  Furthermore, 


and 


SNR  w 


(ir°L) 

SNRop,  ’ 


(3.3) 


1*0  I  2  ~  *o.r,K*a. 


(3.4) 


NSR, 


2 

cw 


-  2 


I* 


2  ' 


(3.5) 
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We  define 


anW  (K  ,N)  =  E\olj  =  E\o2nw\ 

(3.6) 

SNR  nw(K,N)  =  £|SNR„J, 

(3.7) 

o2w  ( K,N )  =  E\a2cJ, 

(3.8) 

scw  ( K,N )  =  £|!*'|2}, 

(3.9) 

NSRch.  (AT, AO  =  E{ NSRfkv). 

(3.10) 

Equations  (3.6)  to  (3.10)  are  the  first  moments  or  average  transient  values  of  the  previously  defined 
output  measures  of  the  GS  cancellers.  These  output  measures  are  commonly  used  to  grade  the  con¬ 
vergence  performance  of  the  SMI  canceller. 

In  addition,  the  /  th  moment  of  SNR„rt  is  defined  as 

SNR<!>  (K,N)  =  £|(SNRmv)' ) ,  (3.11) 

and  the  /  th  moment  of  |  s  '  |  2  is  defined  as 

s£(K,N)  =  £{(M2)'),  (3.12) 


where  scw(K ,  /V)  =  s$(K,  N). 

In  the  succeeding  sections,  expressions  for  o2w{K,N),  SNR nw(K,N),  cj2w(K,N),  scw(K,N), 
NSR<lv(/C, N),  SNR^J  (K ,N),  and  (K,N)  are  derived  by  using  the  GS  canceller  as  an  analysis 
tool. 

IV.  SMI  AND  GS  CANCELLER  EQUIVALENCE 

In  this  section  the  SMI  canceller  and  the  GS  canceller  are  shown  to  be  equivalent  in  the  sense 
that  the  SMI's  estimated  linear  weighting  vector  is  identical  to  the  GS's  equivalent  estimated  vector. 
Hence,  if  either  concurrent  or  nonconcurrent  processing  is  used,  the  output  of  the  SMI  and  GS  can¬ 
celler  is  identical  in  the  transient  state  (finite  averaging).  For  this  equivalence  to  be  true,  infinite 
computational  accuracy  and  the  nonsingularity  of  the  estimated  input  covariance  is  assumed. 

We  briefly  describe  the  SMI  algorithm  for  the  SLC  configuration.  If  Ra  is  the 
(/V  -  I)  x  (/V  -  1)  auxiliary  input  covariance  matrix  and  ram  is  the  N  -  1  length  cross-correlation 
vector  of  auxiliary  channel  against  the  main  channel,  the  optimal  (minimum  output  variance)  N  —  1 
length  weighting  vector  wa  is  given  by 


=  R„ 


(4.1) 
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For  the  SMI  algorithm,  Ra  and  ram  are  estimated,  the  auxiliary  weighting  vector  is  calculated  by 
using  Eq.  (4.1),  this  vector  is  applied  to  the  auxiliary  channels,  and  the  resultant  is  subtracted  from 
the  main  channel. 

Define  the  input  data  vector 

xa(&)  =  x2(k),  ■ . .  ,  *w-i(fc)]r  ,  k  =  1,  2,  . . .  ,  K  (4.2) 

where  a  refers  to  the  auxiliary  channels  and  T  denotes  the  vector  (or  matrix)  transpose  operation. 
The  estimates  of  Ra  and  ram ,  denoted  by  Ra  and  ram ,  are  given  by  the  expressions 

Ra  =  T  E  *!(*)  (4-3) 

A  *=1 

and 


=  7  E  x*(k)x0(k),  (4.4) 

A  *  =  l 

where  the  estimated  linear  weighting  vector  can  be  found  by  using  the  equation 

Ra*  =  •  (4-5> 


We  define  the  following  -length  input  data  vectors 

x„  =  [x„(l),  JCn(2),  . . .  ,  4T„(^)]r  ,  n  =  0,  1,  . . .  ,  N  -  1 

and  a  K  x  {N  -  1)  auxiliary  input  data  matrix  A ,  where 


(4.6) 


A  —  (Xj,  x2,  . . .  ,  xN_,). 


(4.7) 


It  is  straightforward  to  show  that 


and 


Ra  =  A' A 


(4.8) 


ram  =  A'x o,  (4.9) 

where  t  denotes  the  conjugate  transpose.  Thus,  by  using  Eq.  (4.5), 

w  =  ^a'lfam  =  (A'Ay'A'xQ.  (4.10) 

Note  that  a  necessary  condition  that  Ra  be  nonsingular  (and  hence  a  unique  solution  for  w  exists)  is 
that  K  >  N  -  1.  To  show  this,  assume  that  K  <  N  -  1  and  define  an  (N  -  1)  x  (N  -  1)  aug¬ 
mented  matrix  Aaug  as 
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(4.11) 


where  the  last  N  —  K  —  1  rows  are  zero  filled.  Now 

^ a  ~  ^aug^aug  •  (4.12) 

However  the  determinant  of  Ra,  denoted  by  det(fla),  equals  det(Aaug)  ■  det(A’aug).  Since 
det(/4au^)  =  0,  it  follows  that  det(i?a)  =  0  so  that  Ra  is  singular  if  K  <  N  -  1. 

We  now  show  that  if  K  =  N  -  1 ,  that  output  noise  residue  is  zero  for  the  concurrent  processor 

implementation.  Set  w  =  (w,,  w2,  . . .  wA,_1)  and  let  z  be  the  K  length  output  residue  vector  of  the 

concurrent  canceller.  As  a  result 

N- I 

Z  =  X  -  E  wn\n.  (4.13) 

N  =  \ 

By  using  Eq.  (4. 10),  it  can  be  shown  that 

z  =  Xq  -  A^A'Ay'A'xo 

=  (lK  -  A(A‘ArlA')x o,  (4.14) 

where  IK  denotes  the  K  x  K  identity  matrix.  For  K  =  N-l,  A(A'A)~lA'  =  lK,  so  that  z  =  0. 

As  a  result  of  the  preceding  discussion,  in  the  following  development  for  nonconcurrent  process¬ 
ing  we  restrict  K  >  N  —  1 ,  and  for  concurrent  processing  we  restrict  K  >  N  . 

If  we  set  w  =  (w,,  w2,  . . .  ,  wN_,)r,  then  it  can  be  shown  that  Eq.  (4.5)  reduces  to  solving  the 
following  system  of  linear  equations: 

N  - 1 

£  x{xnvv„  =  -  x[xo,  (4.15) 

n  =  1 

N-\ 

£  xjx„  w„  =  -  x^r0, 

n=  1 


N-  1 

£  x^_,xn  W„  =  -  X^_,Xo. 

n  =  1 


to 
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We  show  that  the  solution  for  the  SMI  weights,  w  j ,  h»2,  ...  ,  _ , ,  which  orthogonalizes  the  auxili¬ 

ary  input  data  vectors  xn,  n  =  1,  2,  . . .  ,  N  —  1  to  the  output  residue  vector  z,  is  obtained  by  solv¬ 
ing  the  system  of  equations  given  by  Eq.  (4.15);  i.e.,  the  condition 

x'  z  =  0,  n  =  1,2 . N  -  1  (4.16) 

results  in  a  system  of  equations  identical  to  Eq.  (4.15).  Note  that  Eqs.  (4.13)  and  (4.16)  imply  con¬ 
current  processing.  However  the  weight  calculation  is  valid  for  either  concurrent  or  nonconcurrent 
processing. 

If  the  concurrent  GS  canceller  orthogonalizes  the  output  data  vector  with  respect  to  the  auxiliary 

input  data  vectors  x„,  n  =  1,  2 . N  —  1,  it  follows  that  the  equivalent  linear  weighting  vector 

associated  with  the  GS  structure  is  identical  to  that  computed  for  the  SMI  algorithm.  We  show  that  a 
concurrent  GS  canceller  orthogonalizes  the  output  data  vector  z  with  respect  to  the  auxiliary  input 
data  vector  by  using  mathematical  induction.  This  is  obviously  true  for  N  =  2;  we  assume  that  it  is 
true  for  all  integers  less  than  N  -  1  and  demonstrate  that  it  is  true  for  when  the  number  of  channels 
equals  N . 

From  Fig.  2  and  our  assumptions: 

xi2,,z  =  0,  n  =  1,  2,  ... ,  N  -  2  (4.17) 

and 

x#l',  x<2>  =  0,  «  =0,  1,2,  ...,N  -2,  (4.18) 

where  x^\  x^2)  are  the  AT -length  data  vectors  associated  with  x„(1),  xn(2),  respectively.  Furthermore, 
the  output  vector  can  be  written  as 


z 


42>  -  N£  W'n  X<2>, 


n  =1 


(4.19) 


where  W J,  Wj,  . . . ,  H^jy-2  is  representative  of  the  equivalent  linear  weighting  of  the  input  vectors 
from  level  2  through  N  -  1  of  the  GS  structure  (see  Fig.  1(a)).  Using  Eq.  (2.1)  in  Eq.  (4.17)  results 
in 


v<2)( 

An 


Z 


(x^l)  -  w^xj^L,)'  z  =  x$,l)ti  -  z  =  0,  n  =  I,  2,  . . . ,  N  -  2. 


(4.20) 


From  Eqs.  (4.18)  and  (4.19)  it  can  be  shown  that 

x^'.z  =  x#>',  42)  ~  V  ^xP'x^,  =  0.  (4.21) 

n-  1 

Thus,  from  Eqs.  (4.20)  and  (4.21),  it  follows  that 

x'^'z  =  0,  n  =  1,  2 . N  -  1.  (4.22) 

Since  x„  =  x^1*,  n  =  \,2,  ...  ,N  -  1,  we  have  shown  that  the  auxiliary  input  data  vectors  are 
orthogonal  to  the  output  vector. 
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V.  INVARIANT  TRANSFORMS  AND  GS  THEOREMS 

In  this  section,  we  discuss  two  types  of  matrix  transforms  on  the  input  data  that  significantly 
simplify  the  forthcoming  analysis.  In  addition,  two  theorems  related  to  concurrent  GS  processing  are 
presented.  Let  C  be  any  N  x  N  nonsingular  matrix.  It  is  well  known  [7]  that  transforming  the 
input  channels  xM,  xx,  . . . ,  xN_l  by  this  transform  does  not  change  the  transient  or  steady  state  per¬ 
formance  of  the  SMI  (or  GS)  canceller.  GS  cancellation  is  equivalent  to  a  specific  matrix  transforma¬ 
tion  of  the  input  channels.  For  the  GS  canceller,  the  transform  matrix  C  has  the  upper  triangular 
matrix  form.  Figure  4  shows  an  equivalent  configuration  of  a  GS  canceller  in  the  transient  state. 
Here  C  is  implemented  by  passing  the  input  channels  through  a  GSa  jv  structure  followed  by  a  power 
equalizer  on  the  output  auxiliary  channels.  The  output  powers  of  the  AUX  channels  after  power 
equalization  are  equal  to  <j£in.  Note  that  each  input  channel  into  the  GSK  N  structure  is  orthogonal  in 
the  steady  state  to  the  other  channels  and  that  all  input  channels  have  the  same  power  level,  a,;nrr 
Also,  without  loss  of  generality  we  can  define  a^in  =  1 . 


Fig.  4  —  Residue  equivalent  GS*  *  canceller 
using  the  power  equalizer  matrix 
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The  structure  shown  in  Fig.  4  illustrates  that  any  GS  canceller-structure  can  be  divided  into  two 
parts— a  deterministic  steady-state,  front-end  processor  whereby  the  main  channel  is  decorrelated  from 
the  auxiliary  channels  and  a  stochastic  back-end  processor  that  is  driven  by  uncorrelated  equal- 
powered  noise  in  each  channel.  The  back-end  processor  is  independent  of  the  input  covariance 
matrix,  and  the  auxiliary  weights  associated  with  the  back-end  processor  go  to  zero  as  K  — •  oo. 
Hence  the  convergence  properties  of  the  GS  canceller  can  be  studied  by  analyzing  the  convergence 
properties  of  the  back-end  processor.  Thus  from  this  point  on,  the  input  channels  are  assumed  to  be 
orthogonal  and  of  equal  power. 

A  second  matrix  transform  that  significantly  simplifies  the  forthcoming  analysis  is  now  dis¬ 
cussed.  Let  $  be  any  K  x  K  unitary  matrix,  i.e.,  <i>'^  =  IK.  Let  us  transform  each  input  channel 
noise  vector  x„,  n  =  0,  1,  2,  . . . ,  N  —  1  by  <t>  such  that 

x'n  =*xn,  n  =  0,  -  1,  (5.1) 

where  x  ,  n  =0,  1 ,  . . .  ,  N  -  1  is  the  resultant  output  noise  vector.  If  we  input  this  noise  vector 
into  a  GS K  N  canceller,  we  can  show  that  the  estimated  weights  using  the  x„  inputs  are  identical  to 
those  using  the  x'„  inputs.  This  is  easily  proved  by  substituting  x'„  as  given  by  Eq.  (5.1)  into  Eq. 
(4.7),  which  is  the  system  of  equations  that  solves  for  the  auxiliary  weights.  Because 

x"n  x'm  =  (4>x„)'  (<l>xm)  =  x'xm  ,  for  any  n,m  ,  (5.2) 

an  identical  system  of  linear  equations  results.  Thus,  the  estimated  weights  are  identical. 

One  simplification  that  readily  presents  itself  because  of  the  above  invariant  transform  pertains 
to  the  signal  representation  for  concurrent  processing.  Let  the  input  signal  be  represented  by  the  AT  - 
length  vector  s,  where 


s  =  (jj,  s2,  ,  SK)T .  (5.3) 

It  is  known  that  a  unitary  matrix  transform  4>v  exists  that  transforms  s  into  a  AT -length  vector  with  a 
nonzero  first  element  and  all  other  elements  equal  to  zero.  In  fact 

s  =  $jS  =  (Vs's,  0,  0,  . . . ,  0)r.  (5.4) 

Thus  we  need  only  consider  an  input  signal  vector  that  is  proportional  to  the  form  (1,  0,  . . . ,  0)r  and 

V?s. 


The  following  two  theorems  illustrate  the  proficiency  of  the  concurrent  GS  canceller  in  perform¬ 
ing  the  “cancellation  operation.” 

Theorem  1:  If 

N- 1 

=  E  C*Xn  -  (5-5) 

n  =  1 


then  the  GSK  N  structure  using  concurrent  processing  cancels  xM  exactly  regardless  of  the  number  of 
input  samples  K  per  channel. 
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Proof:  We  use  proof  by  induction.  First  we  prove  the  theorem  is  true  for  A'  =2;  then  we 
show  that  if  it  is  true  for  N  —  1 ,  it  is  also  true  for  N . 

For  N  =  2,  xM  =  c  tjc  | ,  let  w  be  the  decorrelation  weight  derived  from  K  input  samples  in 
each  channel,  i.e., 

E  (k)x0(k) 

w  =  —£■ - .  (5.6) 

E  l*i<*>!2 

* =i 

Substituting  xM  =  c,*,  into  Eq.  (5.6)  gives  w  =  c,,  and  therefore  xM  -  wx,  =  0.  Thus,  Theorem 
1  is  true  for  N  —  2. 

Assume  that  the  theorem  is  true  for  all  integers  less  than  N .  Now  x„(2),  n  =0,  1 . N  -  2 

are  the  outputs  from  the  first  level  of  GS*  ^  as  illustrated  in  Fig.  1(a).  That  is 

xn2)  =  *n(1)  -  tvn(1)^-i  .  n  =  0,  1 . N  -  2  (5.7) 

where 


w!"  = 


Exf,'l\(k)x<l\k) 

k  =  l _ 

E  l4-i(*)l2 

k  =1 


(5.8) 


For  substituting  Eq.  (5.5)  into  Eq.  (5.8)  gives 


Hence, 


4”  =  N£chw}"  +  C„_,. 
n  =  1 


N-2 


(5.9) 


42)  =41)  -  W’<W-1  =  Ecn^<l)  +  CN  - 

n  =  1 


N-2 

E  Cn  wn  +  CV-I 

n  =  1 


tf'i. . 


=  e2c„  L(i)  -  ] , 

n  =  1  V  J 


N- 2 

E  C.X®. 

n  =  1 


(5.10) 


Figure  5  illustrates  the  form  of  the  output  after  first-level  processing.  Note  that  .r^2'.  ,v[2' . 

4*2  are  the  inputs  to  a  GS*  N_,  structure.  Because  Xq2)  is  a  linear  sum  of  a}2',  ,r(2' . as 

shown  by  Eq.  (5.10),  and  the  theorem  holds  for  A  -  I,  then  x^2)  is  exactly  cancelled  independent  of 
K .  Hence  the  theorem  holds  for  N  and  the  theorem  is  proved  by  induction. 
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Fig.  5  —  Concurrent  GS  canceller  after 
first-level  processing 

A  theorem  that  follows  as  a  result  of  Theorem  1  is 
Theorem  2:  If 


H- 1 

*M  =  L  cnxn  +  e,  (5.11) 

n  =1 

then  the  output  noise  zcw  GSk<n  structure  using  concurrent  processing  depends  only  on  e,  i.  e. ,  zov  is 
N  - 1 

independent  of  £  cn^ c„. 

n  =  1 


Proof:  Let  us  write 


xm  ~  -*oi  +  -*02  *  (5.12) 

where 

N-  1 

-*01  X*  -*02  &  •  (5.13) 

n  ~  1 


From  Fig.  1(a),  all  the  w„(m),  n  >  1  are  independent  of  xM,  x0l,  or  x02,  i.e.,  all  computed  weights  to 
the  right  of  the  main  channel  do  not  depend  on  the  main  channel  inputs. 
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For  H'^1)  (the  first  level,  main  channel  weight). 


£  44  *(*)[■ x$(k)  +x${k) ] 

k=  1 _ 

£  I4-.(*)I2 

*=1 


=  ^5>+w^,  (5.14) 


where 

£  44  *<*)4i>(*) 

41*  =  * £ - ;  (5.15a) 

E  I44(*)l2 

*  =  i 


£44*(*)4J>(*) 

=  -^7 - ■  (5.15b) 

E  I44(*)l2 

Thus 

42)  =  4P  +  x$  -  (41*  +  44  44  . 


=  -  woi’  4-i ]  +  Uo2)  -  wm  44 1» 


=  x$  +  x$ ,  •  (5.16) 

where 


42)  =  4P  "  4!’  4-1 1  x$  =  4^’  -  ^4-1  •  (5-17) 

Note  that  the  outputs  xffl  2nd  x$  could  have  been  computed  independently  of  each  other.  Each 
output  depends  only  on  its  respective  input,  *01  or  x02.  We  can  continue  down  the  main  channel, 
computing  the  outputs  of  each  successive  two-input  GS  as  the  sum  of  the  residues  that  result  from 
components  of  the  input  sum. 

As  a  result,  an  equivalent  GS  canceller  for  a  main  channel  that  is  the  sum  of  two  components  is 
shown  in  Fig.  6.  Here  each  individual  GS^  N  structure  is  computed  with  respect  to  its  main  channel 
input  with  the  outputs  at  each  GSK  N  summed  to  form  the  resultant.  Hence,  because  of  Theorem  1, 
*01  is  nulled  completely  so  that  the  resultant  depends  only  on  x02  or  e .  Thus,  we  need  consider  only 
the  independent  noise  term  of  xM  when  computing  the  output  noise  power. 
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Fig.  6  —  Equivalent  concurrent  GS  canceller  for  main  channel  that 
is  the  sum  of  two  components 

VI.  THE  TWO-INPUT  GS  CANCELLER 

The  basis  for  understanding  the  convergence  properties  of  a  GS  canceller  begins  with  studying 
the  two-input  GS  canceller  illustrated  in  Fig.  7  where  from  the  discussion  in  the  previous  section,  jc0 
and  jc |  are  r.v.’s  that  are  assumed  to  be  equal  powered  and  uncorrelated.  Let 

^cw  (^ov(3X  %cw  (2)»  •  •  -  9 

be  the  concurrent  output  noise  residue  vector  and  Z„w  be  the  nonconcurrent  output  noise  residue.  For 
a  two-input  GS  canceller 


t 


i 


where 


^ov  W  Xj  , 

=  Xq  -  IV  X|, 


*i*o 

w  =  - . 

x{x, 


Furthermore,  the  transient  output  noise  powers  are  given  by 


and 


K 


1 


Kol 


*0*0  - 


[*?  *ol2 
x{x, 


(6.1) 

(6.2) 

(6.3) 


(6.4) 


1  +  |  w  |  2 


1  + 


1  xf  Xq  1  2 

(x{x,)2 


(6.5) 
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(a)  (b) 

Fig.  7  —  Two-input  GS  cancellers 

In  Appendix  A  (and  Ref.  8),  it  is  shown  under  Assumptions  1  to  6  given  in  Section  II  that 
b„w  =  r;  has  the  following  probability  density  function. 


P(V)  =  ^  +  I  ,  V  Z  1- 

(6.6) 

Now  the  transient  SNR  is  equal  to  the  reciprocal  of  a„H,.  Thus  if  SNR  = 
of  elementary  probability  theory. 

1  /on2w  =  p,  then  by  the  use 

P(P)  =  KpK~\  0  <  p  <  1. 

(6.7) 

By  using  Eqs.  (6.6)  and  (6.7),  it  can  be  shown  that 

°L  (K,  2)  -  1  +  K  !  > 

(6.8) 

SNRnw.  (K,2)  =  KK+  r 

(6.9) 

and 

SNR<i>(tf,2)  = 

(6.10) 

From  Eq.  (6.4),  the  expected  value  of  a}.w  conditioned  on  x,  is  given  by 


E\K 


x,  = 


^^min 


£|XqXo)  “ 


xjElXoX^lx, 


xfx, 


(6.11) 
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By  assumption  flx^}  =  amiJi<  »  where  IK  is  the  K  x  K  identity  matrix  and  £{XoXo)  =  Ko^ in. 
Thus  Eq.  (6.11)  reduces  to 


z-i&ilx,}  =  1  -  J. 

Note  that  this  expression  is  independent  of  X!  so  that 

wLi  =  i  -  j 

or 

2)  =  1  -  j. 


(6.12) 


(6.13) 


(6.14) 


Also  note  the  assumption  that  the  inputs  are  Gaussian  r.v.’s  was  not  used  in  this  derivation.  In  fact 
each  data  point  in  either  channel  can  have  any  p.d.f.  so  long  as  it  has  a  zero  mean  and  identical  vari¬ 
ances. 

The  unnormalized  output  signal  vector  through  the  two-input  GS  canceller  is  given  by  the 
expression 


s 


ft  _ 


*i*i 

Xj'x, 


s. 


Hence  the  sample  average  output  signal  power  is  given  by 


K 


r 


(6.15) 


(6.16) 


We  show  in  Section  V  that  s  and  X!  can  be  transformed  by  a  unitary  matrix  without  affecting  the 
resultant  output  measures.  If  we  transform  the  s  and  Xj  by  the  unitary  matrix  defined  in  Section 
V,  then  Eq.  (6.16)  reduces  to 


Now  the  normalized  signal  power  is  given  by 


l*i(Qi2 

£  i*.(*)i2 

/t  =  ! 


(6.17) 
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SO  that 


U'l2  =  l 


E 

*= i 


(6.19) 


It  is  straightforward  to  show  that  if  x,  is  a  normalized  A" -length,  multivariate,  complex  circular  Gaus¬ 
sian  vector,  then  £  =  |  s '  | 2  has  the  following  p.d.f. 


/>(£)  =  (K  -  \)f~2  ,  £  >  0. 


(6.20) 


Furthermore,  the  moments  of  |  s '  | 2  can  be  found  by  using  the  above  p.d.f.  and  are  given  by 


s“J  (K,  2)  = 


K  -  1 
K  -  1  +  / 


From  Eq.  (6.21)  it  follows  that 


sm(K,  2)  =  1  -  — . 


Finally,  we  can  show  that 


NSRrw  = 


—  (xfo  -  |x(xo|2/x{x,) 


1  - 


l*i(D| 


E  l*.(*>l 

* =i 


If  NSRCW  is  averaged  over  Xo,  then 


£(NSRw|x,|  = 


K- 1 


I  - 


l*iO)l 


E  l*i(*)l 

(t=i 


K- 1 


1  + 


.(i)!2 


E  Ui(*)l 

k  =2 


Now  since  X|  is  a  normalized  K -length,  multivariate,  complex  circular  Gaussian  vector, 


E  (  |jc,(1)|2)  =  1  and  E 


1 


E 

K=  2 


K-2 


(6.21) 


(6.22) 


(6.23) 


(6.24) 
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Thus 


NSRW  (K,  2)  =  EfNSR^j  = 


K  -  1 


1  + 


K-2 


(K  -  l)z 
K(K  -  2) 


(6.25) 


VII.  GENERAL  MOMENT  THEOREM  FOR  GS  CANCELLERS 

Let  AK  N  denote  a  transient  unnormalized  moment  of  an  output  measure  (output  noise  power 
residue  or  SNR)  associated  with  a  concurrent  or  nonconcurrent  GS  canceller  with  N  input  channels 
and  K  independent  samples  per  channel.  Define  the  normalized  average  transient  moment  as 


A(K,N)  = 


1 

lim  E\AK  N\ 

K  —  oo 


(7.1) 


where  A(K,  1.  In  this  section  we  prove  the  following  theorem: 

General  Moment  Theorem  for  GS  Cancellers:  If  assumptions  1  to  6  hold,  then 

A(K,N)  =  A(K  ,2)  A(K  -  1,  N  -  1) 

or  equivalently 


(7.2) 


K 

A(K,N)  =  n  A{k,2).  (7.3) 

k=K-N+ 2 

Proof.  We  prove  this  by  mathematical  induction.  First,  the  theorem  is  obviously  true  for  N  =2. 
Thus,  we  can  assume  that  the  theorem  is  true  for  all  integers  less  than  or  equal  to  some  upper  bound: 
N  —  1.  We  can  then  show  that  it  is  true  for  V,  which  implies  that  it  is  true  for  any  N  >  2. 

Again  assume  that  all  input  channels  are  of  equal  power  and  uncorrelated.  It  is  shown  in  Ref.  7 
and  discussed  in  Section  V  that  this  assumption  does  not  change  the  output  measures.  A  GSK  N  struc¬ 
ture  can  be  decomposed  as  shown  in  Fig.  8  into  a  first-level  processor  followed  by  a  GSK  N  _ ,  struc¬ 
ture.  The  output  data  AC -length  vectors  of  the  first-level  processor  can  be  written  as 


y n  xn  ’ 


or 


w 


xV-)Xn 
X‘S  [  X  v  i 


n  =  0,  1 . N  -  2 


(7.4) 


or 


XV  -  1  Xn 

y  n  —  xn  ,  x*-l 

XV - 1XN - 1 


y„ 


- 


XV-IXV-I 


'V  -  1  -  1 


n  =  0,  1,  2 . N  -  1 . 


(7.5) 


21 


GERLACH  AND  KRETSCHMER,  JR. 


X0  X1  X2  XN-2  XN  -  1 


It  can  be  shown  that 


XN-|X^-t 

xN-lxN-l 


4>'A<I>, 


(7.6) 


where  is  a  K  x  K  unitary  matrix  and  A  is  a  diagonal  matrix  where  the  first  element  is  0  and  all 
other  diagonal  elements  are  equal  to  1 .  Thus 

y„  =  A  $xn  ,  n  =  0,  1,  . . .  ,  A  -  2 .  (7.7) 

As  shown  in  Section  V,  the  output  data  set  y„ ,  n  =  0,  1,  . . . ,  N  -  2  can  be  transformed  by  a  uni¬ 
tary  matrix  $  and  not  change  the  equivalent  transient  weighting  vector  of  the  GSK  N^]  structure. 
Thus 


un  =  <t»y„  =  A  <hx„  ,  n  —  0,  ,  N  — 2.  (7.8) 

Now  set  v„  =  4>x„.  Because  x„  is  a  normalized  /(-length  multivariate  complex  circular  Gaussian 
vector,  then  v  is  the  same.  As  a  result,  using  the  form  of  A  and  setting  u„  =  (unl,  un2,  . . .  ,  u^)T , 
v„  =  (vn  i ,  vn2,  . . .  ,  v„*)r,  it  follows  from  Eq.  (7.8)  that 

«-i=0  (7.9) 

and 


u„k  =  vnk  ,  k  =  2,  3 . K 

Hence,  the  input  r.v.'s  to  the  GSK  N_]  structure  are  identically  distributed  to  the  input  r.v.’s  to  the 
GS*jv  structure  except  that  their  number  has  been  reduced  by  one  as  illustrated  in  Fig.  9. 
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Fig.  9  —  Further  decomposition  of  GSK 


Consider  the  implications  of  this  new  structure  by  using  concurrent  processing.  Let  BK_{ 
denote  a  transient  unnormalized  moment  of  any  output  measure  (output  noise  residue  or  SNR)  associ¬ 
ated  with  the  GS^_!  N_x  structure.  Note  that  AKN  =  BK_ \  s-\-  Then  according  to  the  General 
Moment  Theorem, 


I xiV-i1 

lim  E\Bk  _i  |  x^v - 1 1 


=  A{K  -  1,  N  -  1). 


(7.10) 


Note  in  the  limit  taken  above  that  K  goes  to  infinity  only  in  the  GS^_(  N_]  structure  and  not  in  the 
first-level  processor  that  precedes  the  GS*_j  structure.  Thus 

E\^k,n  !  -i)  =  A{K  —  1,  N  —  1)  lim  E\Bx~i .yv  —  j  1  Xat-i)  (7.11) 

K  —  oo 

or 

E{Ak'N\  =A(K  -  1  ,N  -  1)  lim  ffB*-!.*-!)  •  (7.12) 

A'—  00 


The  limit  in  Eq.  (7.12)  can  be  evaluated  simply.  It  is  equal  to  the  associated  moment  of  the 
output  measure  coming  out  of  the  first-level  processor  in  the  main  channel  or 

lim  {B/f_i,y_|j  =A(K, 2)  lim  (7.13) 

K  —  oo  K  —  oo 
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Now,  the  limit  on  the  right-hand  side  of  Eq.  (7.13)  is  equal  to  a^jn  =  1.  Hence  the  theorem  is 
proved  for  concurrent  processing. 

We  now  prove  the  theorem  for  nonconcurrent  processing.  The  nonconcurrent  data  sample  after 
first-level  processing  can  be  written  as 

Y„  =  Xn  -  wnXN_y,  n  =0,1 . N  —  l ,  (7.14) 


where  wn  is  given  in  Eq.  (7.4).  Given  \N _ ,  and  it  can  be  shown  that  the  Yn, 

n  =  0,  1,  ....  N  -  2  are  uncorrelated  complex  Gaussian  random  variables,  i.e.,  E{Yn  Y* J  =0  and 
E{  |  Y„  | 2)  equals  a  constant.  Thus  the  Yn ,  n  =  0,  1 ,  . . . ,  N  -  2  conditioned  on  and  XN_,  are 
identically  distributed  as  the  Xn ,  n  =  0,  1 ,  . . . ,  N  -  1 .  Also  note  that  the  4>  transformation  does 
not  change  the  equivalent  linear  weighting  of  the  GS processor  or  its  equivalent  GS^,,  A>_,  pro¬ 
cessor.  Hence,  the  inputs  (concurrent  and  nonconcurrent)  into  the  GS^_1N_[  processor  are  identi¬ 
cally  distributed  as  those  into  the  GSK  N  processor. 


Defining  BK^X  N_{  as  before,  it  follows  from  the  General  Moment  Theorem  that 


|  Xjy-i,  -Yjy-i) 

lim  E[BK_l  N_l  |  xN_j,  XN_ij 

K^OO 


=  A(K  -  1,  N  -  1). 


(7.15) 


By  using  Eq.  (7.15)  and  reasoning  similar  to  that  used  for  proving  the  theorem  for  concurrent  pro¬ 
cessing,  the  theorem  follows  for  nonconcurrent  processing. 


VIII.  CONVERGENCE  RESULTS 

In  this  section,  the  General  Moment  Theorem  for  GS  Cancellers  (Eq.  (7.3))  is  used  to  derive  a 
number  of  results.  Most  of  these  results  are  demonstrated  in  Refs.  7  and  8. 

Employing  the  General  Moment  Theorem  under  Assumptions  1  to  6  and  the  expressions  for 
<&(*. 2),  sw(K, 2),  sc(l\K,2),  NSRW(A\2),  a2nw{K,l),  SNR^ff  ,2),  and  SNR^(K,2)  given  in 
Section  IV,  we  can  show  that 


j2  ( 
JCW\ 


scw(K,N)  =  1  - 


sg)  (K,N)  = 


NSR  CW(K,N)  = 


anw(K  ,N)  = 


1  i* 

1 

(8.1) 

1 

K  ’ 

(8.2) 

1)!  {K  +  i  -  N)\ 

(8.3) 

N)\  (K  +  i  —  1)!  ’ 

-  1)  {K  -  N  +  1) 

K  (K  -  N) 

(8.4) 

K 

K  -  N  +  1  ’ 

(8.5) 
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SNRniv(tf,/V)  =  -  — 


(8.6) 


SNRi'j  (K,N) 


K\  {K  +  i  -  N  +  1)! 
(K  -  N  +  1)!  ( K  +  /)!  ‘ 


(8.7) 


Equations  (8.1)  to  (8.3)  and  (8.5)  to  (8.7)  are  given  in  Refs.  7  and  8. 

Equation  (8.7)  can  be  used  in  ad  hoc  fashion  to  find  the  p.d.f  of  SNRnH,  for  any  K  and  N .  If 
p  =  SNR,,*,,  then  the  p.d.f.  that  yields  moments  as  given  by  Eq.  (8.7)  is 


P(P)  (N  -  2)\ 


(8.8) 


(* 


From  Eq.  (8.8),  the  p.d.f.  of  can  be  obtained.  Let  tj  =  a„w  =  1/p.  It  is  straightforward  to 
show  that 


n(  ,  = _ K\ _  (n  ~  D""2 

PW  (N  -  2)1  (K  -  N  +  1)!  vK  +  l 


1  <  1J  <  00  . 


(8.9) 


Note  that  Eqs.  (8.8)  and  (8.9)  were  first  derived  in  Refs.  7  and  8,  respectively. 


IX.  THE  CONCURRENT  GS  CANCELLER 

The  discussion  and  results  in  this  section  pertain  specifically  to  the  concurrent  GS  canceller. 

A.  Input-Output  Matrix  Transform 

Consider  the  GS  canceller  with  all  of  its  output  channels  as  shown  in  Fig.  2(b).  The  K -length 
input  channel  vectors  are  transformed  into  /(-length  orthogonal  output  vectors  that  form  an  orthogonal 
basis  for  the  N  -  1  auxiliary  input  vectors,  i.e.,  if  z„  =  (z„(l),  z„(2),  . . . ,  z„(/())r, 

n  =  0,  1 . N  —  1  are  the  output  channel  vectors,  then 


=  °’  "1  *  n2- 


(9.1) 


It  is  elementary  to  show  that  the  main  channel  output  vector  at  the  n  +  1th  level,  x^"  +  1\  is 
given  by  the  expression 


x<5n+1)  = 


,  zN-nzN-n 

‘k  ; 

zN-nzN-n 


xT- 


(9.2) 


Thus  because  zfW  =  Zq  =  \^N)  and  x<)  = 

=  x^'\  it  follows  that 

N- 1 

n 

r  'n 

f  ZnZn 

lK  t 

n  =  1 

znzn 

Xo- 


(9.3) 
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Equation  (9.3)  can  be  simplified  by  using  Eq.  (9.1)  as 


z 


cw 


Ik  -  E 


*->  2„Z' 


n  =  1 


*0- 


(9.4) 


Set 


!k  ~  L 


N~ 1  z„zi 


n  =  1  2n  Zn 


(9.5) 


The  K  x  K  matrix  G  is  the  input-output  matrix  transform.  In  addition,  Gx„  =  0, 

n  —  1,  2 . N  —  1.  Thus,  the  rows  of  G  have  an  orthonormal  basis  that  is  orthogonal  to  the 

orthonormal  basis  of  n  =  1,  2,  . . . ,  N  —  1.  If  we  write  the  auxiliary  input  samples  as  a 
K  X  (N  -  1)  matrix,  Xaux,  then  the  orthonormal  basis  of  the  rows  of  G  is  the  same  as  the  orthonor¬ 
mal  basis  for  the  null  space  of  C.  A  matrix  having  the  form  given  by  Eq.  (9.5)  is  sometimes  called  a 
complementary  projection  matrix.  Furthermore  G  =  G1 ,  G  is  idempotent  so  that 


G2  =  G. 


(9.6) 


B.  Sample  Average  Bound 

We  use  Eq.  (9.4)  to  prove  the  following: 

Output  Power  Sample  Average  Theorem:  For  concurrent  GS  cancellers,  the  sample  average  of  the 
output  power  residue  is  always  less  than  or  equal  to  the  sample  average  of  the  input  power. 

Proof:  The  input  and  output  power  sample  averages  are  given  by 

=  J  *0*0  (9’7> 


-2  J_  < 


(9.8) 


By  use  of  Eqs.  (9.4)  and  (9.6), 


—  (Gxo/Gxo 


=  XoGxo 

i  ,  i  v 

=  -p  *o*o  -  -p  L 

K  *  n=  1 


I  z'Xpl 2 


=  if,  --f  £ 
K  »=1 


1  N~'  1^*0 1 


Since  the  summation  term  in  Eq.  (9.9)  is  always  positive,  the  theorem  follows. 


(9.9) 
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C.  P.D.F.  of  the  Output  Noise 

Define  An  to  be  a  K  x  K  diagonal  matrix  with  0’s  in  the  first  n  diagonal  elements  and  1  ’s  in 
the  rest.  It  is  straightforward  to  show  that  G  can  be  written  in  the  form: 

G  =  &  Ajv_!  $,  (9.10) 

where  $  is  a  K  X  K  unitary  matrix.  Thus 

°L  =  A*  _,#*<>  (9.11) 

=  J  ($*0)'  (4>Xo). 

We  set  a  K -length  vector,  u  =  4>xo.  If  xq  is  a  normalized  AT -multivariate  complex  Gaussian  circular 
process,  it  can  be  shown  that  u  is  the  same.  Because  of  the  form  of 

^  =  4  E  \uk\2-  (912) 

A  k=N 

It  is  elementary  to  show  that  K has  a  K  -  N  +  1  order  chi-square  p.d.f.  Thus  if  ij  =  &^,  then 

tsK~N  +  \ 

P(V)  =  lK-~Ny  VK'N  e~K\  r,>0.  (9.13) 

Note  that  the  output  noise  power  p.d.f.  is  independent  of  the  auxiliary  p.d.f.’s  and  their  independence 
properties.  Only  the  main  channel  must  be  Gaussian,  and  its  samples  must  be  independent.  Equation 
(9.13)  is  the  same  p.d.f.  that  was  derived  by  Brennan  and  Reed  in  Ref.  8  under  the  assumption  that 
all  input  channels  are  Gaussian. 

Hence  the  following  theorem  results: 

Concurrent  GS  Canceller  Convergence  Theorem:  If  the  main  channel  of  an  N -input  GS  canceller 
consists  of  a  K  -length  multivariate  complex  circular  Gaussian  vector  of  samples  that  are  also  indepen¬ 
dent  of  the  non-time-coincident  samples  of  the  auxiliary  channels  (Assumption  3,  Section  II)  and  the 
auxiliary  channels  have  arbitrary  p.d.f  s,  autocorrelation  Junctions,  and  cross-correlation  functions 
with  other  auxiliary  channels,  then  the  normalized  output  noise  power  residue  t>  =  has  a  p.d.f 
given  by  Eq.  (9. 13). 

D.  P.D.F.  and  Power  of  the  Signal 

The  input  signal  vector  s  is  also  transformed  by  the  input-output  matrix  transform  in  the  GS 
structure.  It  is  shown  in  Section  VIII  that  the  moments  of  the  output  signal  power,  |  s  ’  | 2,  are  given 
by  Eq.  (8.3).  It  can  be  shown  ad  hoc  that  if  the  inputs  satisfy  Assumptions  1  to  6,  then  the  p.d.f.  of 
p  =  |  s ' ) 2  is  given  by 
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P(P)  = 


C K  -  D? 


— r  (1  -  p)N-2pK~N,  0  <  p  <  1 . 


(K  -  N)l  ( N  -  2)1 

Again  this  is  the  same  result  that  was  derived  in  Ref.  8  although  the  signal  model  was  different 


(9.14) 


The  input  signal  vector  s  is  transformed  by  the  projection  matrix  G  defined  by  Eq.  (9.5)  into  the 
unnormalized  signal  vector  s',  given  by  the  expression 


s'  = 


N~l  Z„Z l 


4  -  E  -7 


n  =  1  z/izn 


S. 


(9.15) 


Again  without  loss  of  generality,  we  can  set  s  =  (os,  0,  0,  . . .  ,  0)r,  where  a}  =  s's  is  the  sum  of 
the  input  signal  power  across  K  samples.  Thus  by  using  Eq.  (9.15), 


sit e'  —  ,2  |  T 

s  s  —  as  i0 


4  -  E 


ZnZl 


n  =  1  znzn 

where  10  =  (1,  0,  0,  . . .  ,  0)r.  Equation  (9.16)  simplifies  to 


1  T 

xo  * 


(9.16) 


s'^s' 


_  V  lz"(1)l2 

2  /  j  g 

1  n=1  E  lz»(*)l 

* = i 


(9.17) 


Now 


Scw(K,N)  =  E 


f  s'V 

N-l 

=  1-  E* 

U„d)l2 

1  n2 

K 

l  - 

■  - 1 

E  U«<*)l2 

*=i 

N-  J 

(9.18) 


If  the  elements  of  jc,,  x2 ,  . . . ,  xN_x  are  identically  distributed  r.v.’s,  then  the  elements  of  z,,  z2  . . . , 
Zn-i  are  also  identically  distributed  r.v.’s.  It  then  follows  that  independent  of  the  p.d.f.’s  of  these 
r.v.’s. 


E 


Und)!2 

E  I  *»<*)! 2 

*=1 

v.  J 


K  ' 


(9-19) 


Thus  we  have  the  theorem: 

Concurrent  GS  Canceller  Signal  Theorem:  If  the  samples  of  the  auxiliary  channels  are  identically  dis¬ 
tributed  r.  v.  ’s  and  the  desired  signal  is  only  in  the  main  channel,  then 

Scw(K,N)  =  l  -  (9-20) 
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E.  General  Result  for  Noise  Power 
We  prove  the  following: 

Concurrent  Processing  Theorem:  Let  an  input  data  element  in  the  vectors  Xq,  xt  . . .  ,  xw_j  have  any 
p.d.f.  (note  each  of  the  KN  elements  could  have  a  different  p.d.f).  Then  under  Assumptions  2  to  6, 
Section  II, 


al  ( K,N )  =  1  - 


N  -  1 


(9.21) 


Proof:  Consider  the  form  of  a2*,  given  by  Eq.  (9.9).  We  can  show 


E{bcw  |  z,,  z2,  . . . ,  z„_,j  =  E{ain)  -  —  £ 

A  n- 1 


1  N~l  ZnE {*bXoK 


z>n 


(9.22) 


Now  E\b}„ )  =  1  and  Elx^j  =  IK.  Hence 


I  zi>  z2>  •  •  •  >  zv-i)  —  1  ~ 


N  -  1 


(9.23) 


The  theorem  follows  by  integrating  both  sides  of  Eq.  (9.23)  over  the  joint  p.d.f.  of  z2,  . . .  ,  Zjv-j. 


F.  Noise-to-Signal  Considerations 

We  show  in  Section  VIII  that  the  average  NSR  is  given  by  the  expression  (repeated  here) 


NSR(K,N) 


(K  -  1)  (K  -  N  +  1) 
K  (K  -  N) 


(9-24) 


The  number  of  samples  where  the  NSR  is  within  3  dB  of  the  optimum,  is  found  by  setting 

NSR  ( K,N )  equal  to  2  and  solving  for  £3dB.  It  is  found  that 


N  +  V(V2  +  4  (V  -  1) 

K  3dB  =  - 2 - •  (9-25) 

However,  note  that  for  K  —  N  +  1,  N  +  2,  N  +  3,  N  +  4,  N  +5, 

NSR  (N  +  1,  N)  =  2  -  —3—’ 

N  +  l 

NSR  (N  +2 ,N)  =  1.5  - 

N  +  2 

NSR  (N  +  3,  AO  =  1.33  -  ,  (9.26) 
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NSR  (N  +4 ,N)  =  1.25  -  , 

N  +  4 


NSR  (N  +  5,N)  =  1.2  -  — 

N  +  5 

Hence,  for  K  —  N  +  1,  the  NSR  is  already  within  3  dB  of  the  optimum  and  for  K  =  N  +  5  is 
within  0.8  dB. 

G.  Discussion 

The  concurrent  GS  canceller  converges  rapidly  as  implied  by  Eq.  (9.18).  In  fact  it  requires 
approximately  N  samples  to  achieve  the  3  dB  performance  point,  opposed  to  2N  samples  required  for 
the  nonconcurrent  processing  [7].  However,  other  losses  must  be  taken  into  account.  These  losses 
are  best  exemplified  by  considering  the  transformed  output  noise  and  output  signal  vectors  after  GS 
cancellation.  These  can  be  written  as 

s'  =  (0,  0, . . . ,  0,  s  'n,  s  '/v  +  i,  . . . ,  s ' fc)T  ,  (9.27) 


n'  =  (0,  0,  ... ,  0,  n'N,  n'N  +  x,  ... ,  n'K ), 


(9.28) 


where  the  first  N  —  1  elements  of  each  vector  are  equal  to  zero.  Signal  detection  losses  occur 
because  we  have  lost  N  -  1  independent  samples  of  signal  and  noise  by  going  through  the  GS  can¬ 
celler.  Hence,  the  signal  detector  after  the  GS  canceller  whether  it  be  a  coherent  integrator  or  some 
other  detector  scheme  (hypothesis  test  statistic  generation  and  thresholding)  has  lost  independent  data 
samples,  which  decreases  the  probability  of  detection  of  the  signal.  Thus  even  though  K  -  N  +  1 
samples  yields  an  NSR  per  time  sample  that  is  within  3  dB  of  the  optimal  steady  state  ( K  —  oo),  the 
output  samples  residue  are  so  correlated  that  they  are  equivalent  to  having  only  two  independent  sam¬ 
ples  out  of  the  N  +  1  output  samples.  As  a  result,  signal  detection  after  GS  cancellation  can  degrade 
significantly. 

X.  LOWER  BOUND 

In  this  section,  we  derive  a  lower  bound  associated  with  convergence  of  a  nonconcurrent  GS 
canceller  when  the  input  data  are  not  necessarily  Gaussian.  In  the  analysis,  Assumptions  2  to  6  of 
Section  II  hold.  An  element  of  the  input  data  vectors  Xq,  x},  . . . ,  xiV_)  can  have  any  p.d.f. 

A  result  for  the  two-input  GS  canceller  is  first  established.  For  this  case,  it  was  previously 
shown  that 


» 2  ,2  _  ,  A  xfXpX^X, 

1  "b  |  |  1+  ,  f  j 


(*]Xir 


We  can  show  that  because  £{xoXo)  =  OmJic  > 


E\°nw  I  x,)  =  1  + 


xfx, 


(10.1) 


(10.2) 
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Thus 


oL(K,  2) 


E[o2nw)  =  1  +  •  £ 


(10.3) 


Appendix  B  shows  that  if  z  is  any  r.v.  with  a  nonzero  mean,  then 


(10.4) 


Applying  this  inequality  to  Eq.  (10.3)  results  in  the  inequality 

°L(K,  2)  £  1  +  j. 


(10.5) 


We  now  state  the  following  theorem: 

Nonconcurrent  Processing  Theorem:  Let  an  input  data  element  in  the  vectors  x<J,  x, . xn-\  have 

any  p.d.f  Then  under  Assumptions  2  to  5,  Section  II, 

o2nw(K,N)  >  ■  (10.6) 

Proof:  The  proof  of  this  theorem  is  by  induction  and  again  closely  follows  the  proof  of  the  General 
Moment  Theorem.  We  have  shown  that  it  is  true  for  N  =2  (Eq.  (10.5)).  Thus  the  theorem  is  true 
for  all  integers  less  than  or  equal  to  some  upper  bound:  N  -  1.  We  show  this  is  true  for  any  N, 
which  implies  that  it  is  true  for  any  N  >  2. 

Again  the  GS*,/v  processor  is  decomposed  as  shown  in  Fig.  8  and  further  reduced  as  shown  in 
Fig.  9.  Neither  this  decomposition  nor  reduction  depend  on  the  p.d.f. ’s  of  the  input  data.  Also  the 
concurrent  data  entering  the  GS*_i  processor  satisfy  Assumptions  2  to  6.  In  addition,  the  non¬ 
concurrent  data  conditioned  on  \N  - 1  and  Xn-i  satisfy  Assumptions  2  to  6. 

2 

Thus  if  /v-i  is  the  transient  unnormalized  output  noise  power  residue  using  nonconcurrent 
processing  of  the  GSK_  \  canceller  where  d%N  =  a|_(  ,V-i  and  if  the  Nonconcurrent  Processing 
Theorem  is  true  for  N  -  1 ,  then 

£(<rx-i,N-i  I  XN-1>  K  _  „ 

- 7 -  s  - r, - ~  -  (10.7) 

Jim  fj&tf-i./v-i  |  Xyv-i,  •Xa'-i)  ^  N  +  2 

K  —  oo 


Again  note  that  in  the  limit  taken  above,  K  goes  to  infinity  only  in  the  GSa:_1n_,  structure  and  not 
in  the  first-level  processor  that  precedes  the  GSK_l  N_}  structure. 

From  Eq.  (10.7),  it  follows  that 

vt  —  m  ,  o  '  J*m  (10.8) 

K  —  N  +  2  X- * 
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We  know  that 


lim  E[o2k_ ijv-i)  =  °L(K> 2)  >  1  +  -p.  (10.9) 

Substituting  Eq.  (10.9)  into  Eq.  (10.8)  results  in  the  theorem  being  proved. 

Note  that  if  the  input  noises  are  Gaussian,  then  the  lower  bound  given  by  Eq.  (10.6)  is  almost 
achieved.  Hence,  this  assumption  results  in  almost  the  “best  case”  performance. 

XI.  SLIDING  WINDOW  GS  CANCELLER  CONVERGENCE 

The  sliding  window  GS  canceller  differs  from  the  block  processing  schemes  previously 
described  in  that  data  are  processed  on  a  sample  point  by  sample  point  basis  rather  than  in  complete 
blocks.  For  the  sliding  window  GS  canceller,  the  GS  weights  are  estimated  every  time  step.  Thus, 
the  sliding  window  canceller  tends  to  adapt  to  a  nonstationary  noise  environment  better  than  a  block 
processor  canceller  does.  The  weights  that  are  calculated  in  the  GS  structure  are  based  on  a  fixed 
number  of  past  samples  from  a  given  point  in  time.  When  the  sampler  steps  one  time  interval,  the 
newest  sample  is  included  in  this  estimate  and  the  oldest  sample  is  discarded. 

The  sliding  window  GS  canceller  is  described  mathematically  as  follows.  For  any  indexed  time 
instant  j,  set 


*oO)  ^raO  1)>  •  •  •  Xm(j  K  1)]  > 

*»(/)  =  [x„(j),xnU  -  1),..  .,xn(j  -  K  +  l)lr,  n  =  1,2 . N  -  1  (11.1) 


where  K  is  the  number  of  samples  used  to  calculate  each  of  the  GS  weights.  From  Fig.  1(a),  the 
input  vectors  into  the  m  th  level  of  the  GS  structure  are  defined  as 


4m)(j)  =  lxn(m)<j),  -  1),  •  •  • ,  *„<m)(/'  —  K  +  l)f  ,  (11.2) 

where  =  x„(j)  and  =  x„(j).  The  outputs  of  the  two-input  GSs  at  the  mth  level  are 

given  by 


where 


(m  +  l) 
■*n 


O') 


=  X„(m)(j)  -  **”>(/)  x#L>mV), 


n  =  0, 1,. . . ,  N  —  m  —  1 
m  =  1,2,...  ,  N  -  1 


xn,0)  <m)(j) 

xfr-mU)  X/F-mU) 


(11.3) 


(11.4) 


We  state  and  prove  the  following  theorem: 


Sliding  Window  GS  Canceller  Theorem:  If  the  input  samples  are  identically  distributed  r.v.  ’s,  then 
the  p.d.f.  's  of  a  given  transient  performance  measure  (output  noise  power  residue  or  signal-to-noise 
ratio)  of  the  sliding  window  GS  canceller  and  the  concurrent  GS  canceller  are  identical.  Hence  alt 
average  transient  performance  measures  are  equal  for  the  two  processing  schemes. 
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Proof:  The  proof  of  the  above  theorem  is  rather  simple.  Firstly,  the  output  sequence  of  the  sliding 
window  GS  canceller  is  identical  to  that  of  an  output  sequence  generated  by  using  successive  con¬ 
current  block  processing  and  retaining  only  the  last  output  data  sample.  To  see  this,  let  the  input  data 
vectors  to  the  concurrent  block  processor  at  time  j  be  defined  by  Eq.  (1 1.1).  Let  the  AT-length  output 
sequence  of  this  concurrent  GS  canceller  be  Z\,  Zj,  ■  ■  • ,  Zr.  Now  zk  is  exactly  equal  to  the  output  of 
the  sliding  window  GS  canceller  at  time  instant  j ;  hence,  the  equivalence  of  the  two  processing 
schemes. 


Secondly,  the  p.d.f.’s  of  a  given  transient  performance  measure  for  the  concurrent  block  proces¬ 
sor  are  identical  for  each  zk,  k  =  1,2,. ...  K.  This  results  because  all  inputs  are  identically  distri¬ 
buted  and  the  computed  canceller  weights  are  independent  of  the  ordering  (or  permutation)  of  the 
samples  in  the  K -length  input  vectors  as  long  as  the  permutation  is  the  same  for  all  N ,  K  -length  input 
vectors.  Hence,  the  p.d.f.  of  any  transient  performance  measure  is  identical  for  the  two  types  of  can¬ 
cellers  and  the  theorem  follows. 


XII.  OVERMATCHING  DEGREES  OF  FREEDOM 


If  we  examine  the  expression  for  SNR nw(K,N)  given  by  Eq.  (8.6)  for  the  nonconcurrent  GS 
canceller,  we  find  that  for  a  fixed  number  of  input  samples  K,  the  SNR nw{K,N)  decreases  monotoni- 
cally  as  the  order  of  the  GS  structure  N.  Hence,  for  a  given  input  noise  scenario,  increasing  the 
order  of  the  GS  canceller  may  lead  to  a  noisier  output  (this  phenomenon  also  occurs  for  concurrent 
processing). 

To  illustrate  this  problem,  let  us  say  that  for  a  variety  of  input  noise  scenarios  a  GS*  N  can¬ 
celler  yields  good  cancellation  performance  and  is  therefore  specified  in  the  design.  However,  sup¬ 
pose  for  a  specific  noise  scenario  that  only  an  Lth  order  GS  canceller  is  needed  for  good  performance 
where  L  <  N.  Hence,  for  this  specific  scenario  there  is  a  loss  of  cancellation  performance  by  using 
a  GS*  N  canceller  instead  of  a  GSK  L  canceller.  For  this  case,  what  occurs  is  that  at  the  (L  -  l)th 
level  of  cancellation  in  the  GSK  N  canceller,  the  input  noises  in  the  main  channel  are  essentially  can¬ 
celled.  Thereafter,  in  each  succeeding  level  the  noise  residue  only  increases.  This  phenomenon  is 
called  “overmatching  the  degrees  of  freedom  (DOF).”  For  optimality,  we  should  have  stopped  the 
cancellation  process  after  the  (L  -  l)th  level.  Note  that  the  number  of  DOFs  for  a  GS*  N  canceller 
is  N  -  1 . 


By  using  the  expression  given  for  SNR,,*,  (A" ,  AO  in  Eq.  (8.6),  we  can  quantify  this  loss  for  non¬ 
concurrent  processing.  This  loss  is  given  by 


LOSSnw 


SNR  nw{K,L) 
Smnw{K,N) 


K  -  L  +  2 
K  -  N  +  2  ' 


(12.1) 


One  method  suggested  by  Lewis  and  Kretschmer  [9]  for  overcoming  the  effects  of  overmatching 
the  DOFs  is  to  monitor  the  nonconcurrent  noise  powers  at  each  level  of  the  main  channel  in  the  GS 
structure.  The  main  channel  is  terminated  in  the  GS  structure  where  the  noise  power  is  a  minimum. 
This  point  in  the  GS  structure  varies  as  the  noise  environment  changes  or  equivalently  as  more  or 
fewer  DOFs  are  needed. 


XIII.  SUMMARY 

The  open-loop  GS  canceller  is  shown  to  be  numerically  identical  with  the  Sampled  Matrix  Inver¬ 
sion  (SMI)  algorithm  in  the  transient  state  if  infinite  numerical  accuracy  is  assumed.  Three  forms  of 


33 


GERLACH  AND  KRETSCHMER,  JR. 


the  GS  canceller  are  discussed  and  analyzed— concurrent,  nonconcurrent,  and  sliding  window  process¬ 
ing.  Previous  convergence  results  for  concurrent  and  nonconcurrent  SMI  cancellers  that  assume 
Gaussian  inputs  have  been  reproduced  by  using  the  GS  structures  as  an  analysis  tool.  In  addition, 
new  results  are  obtained  for  when  the  input  noises  are  not  Gaussian.  Furthermore,  the  sliding  win¬ 
dow  GS  canceller  is  shown  to  have  the  same  convergence  properties  as  the  concurrent  GS  canceller. 
The  deleterious  effect  of  “overmatching  the  degrees  of  freedom”  is  discussed. 

XIV.  REFERENCES 

1.  B.L.  Lewis  and  F.F.  Kretschmer,  Jr.,  NRL,  published  works  of  limited  distribution,  dating  back 
to  Feb.  1974. 

2.  R.A.  Monzingo  and  T.W.  Miller,  Introduction  to  Adaptive  Arrays  (John  Wiley  and  Sons.  New 
York,  1980),  Ch.  8. 

3.  W.F.  Gabriel,  “Building  Block  for  an  Orthonormal-Lattice-Filter  Adaptive  Network,”  NRL 
Report  8409,  July  1980. 

4.  M.A.  Alam,  “Orthonormal  Lattice  Filter— A  Multistage,  Multichannel  Estimation  Technique,” 
Geophys.  43,  1368-1383  (1978). 

5.  B.  Friedlander,  “Lattice  Filters  for  Adaptive  Processing,”  Proc.  IEEE  70(8),  829-867  (1982). 

6.  Karl  Gerlach,  “Fast  Orthogonalization  Networks,”  IEEE  Trans.  AP-34(3),  458-462  (1986). 

7.  I.S.  Reed,  J.D.  Mallett,  and  L.E.  Brennan,  “Rapid  Convergence  Rate  in  Adaptive  Arrays,” 
IEEE  Trans.  AES-10,  853-863  (1974). 

8.  L.E.  Brennan  and  I.S.  Reed,  “Digital  Adaptive  Arrays  with  Weights  Computed  from  and 
Applied  to  the  Same  Sample  Set,”  Proceedings  of  the  19*0  Adaptive  An^nna  Symposium. 
RADC-TR-80-378,  Vol.  1,  Dec.  1980. 

9.  B.L.  Lewis  and  F.F.  Kretschmer,  Jr.,  NRL,  private  correspondence. 


34 


Appendix  A 

PROBABILITY  DENSITY  FUNCTION  ASSOCIATED 
WITH  A  TWO-INPUT  GS  CANCELLER 


In  this  appendix,  we  outline  a  derivation  for  obtaining  the  p.d.f.  of  the  transient  noise  power 
residue  and  the  transient  SNR  of  a  two-input  GS.  For  this  analysis  the  estimated  weight  is  applied  to 
a  data  set  that  is  independent  of  the  data  set  that  calculated  the  weight  (nonconcurrent  processing). 

Let  x()  and  X|  be  the  AT -length  input  vectors  associated  with  the  main  and  auxiliary  channels 
respectively.  The  estimated  weight  of  the  GS  canceller  is  then  given  by 


*1*0 
*1*1  ’ 


(Al) 


For  nonconcurrent  processing  the  transient  noise  power  residue  is  given  by 


a2  =  i 


+  w 


We  derive  the  p.d.f.  of  |  w  j 2,  from  which  the  p.d.f.  of  a2  is  easily  attainable. 
Now  let  z  =  |  w  |  2  or 

I  *(*o  l2  ,  *1*1  1 

=  —  -  V*  - - V-  - 


z  = 


\2 


(*l*l) 


=  *6  —  *o  — 

*{*i  *i*i 


Let 


,  *i*i 

v  *  *o  -T“  *o- 
*1*1 


We  derive  p(v  |  Xi).  The  matrix  xix{/x{xi  can  be  written  as 


*i*i 

*{*i 


=  <!>'  A  $  , 


(A2) 


(A3) 


(A4) 


(A5) 


where  is  a  K  x  K  unitary  matrix  and  A  is  the  diagonal  matrix  of  eigenvalues  where  the  first  diag¬ 
onal  element  equals  1  and  all  others  equal  0.  Thus 

v  =  Xq$'A4xq  =  (4»Xo)'  A  <t>  Xq.  (A6) 
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Set  a  new  AT -length  vector  y  =  <t>Xo.  It  is  easy  to  show  that  if  Xq  is  a  normalized  A*- length  multivari¬ 
ate  complex  circular  Gaussian  vector,  then  y  is  the  same.  If  y  t  is  the  first  element  of  y,  then  from 
Eq.  (A6) 

v  =  |y,|2.  (A7) 

The  p.d.f.  of  |yt  | 2  is  the  well-known  second  order  chi-square  and  thus  it  can  be  shown  that 

p(v  |  =  e  ~v  ,  v  >  0.  (A8) 

Note  that  this  p.d.f.  is  independent  of  X\.  As  a  result, 

p(v)  =  e~v ,  v>0.  (A9) 

From  Eq.  (A3)  and  elementary  probability  theory  we  can  show  that 

p(z  |jc ,)  =  x{x,e  "x'Xi ,  z  >  0 


(A10) 

(All) 

(A12) 

(A13) 

(A14) 

(A15) 
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Appendix  B 

FIRST  MOMENT  BOUND 


If  z  is  a  random  variable  with  z  >  0,  0  <  £(z|  <  oo  and  E\\/z\  <  oo,  then 


We  use  the  Cauchy-Schwarz  inequality  to  show  this.  Let  p(z)  be  the  p.d.f.  of  z.  Now 

Jo  p{z)  dz  =  yj ■  >fzp(z)  dz  =  1 , 


(Bl) 


(B2) 


where  the  square  root  function  shown  above  is  the  positive  square  root  function.  Using  the  Cauchy- 
Schwarz  inequality, 


*  '  f0"?P(z)<fc  a  \~y[2&->IzpV)<k. 

Thus 

L  dz  •  J  zp(z)  dz  ^  1. 

J0  £  •'0 

Equation  (Bl)  follows  from  Eq.  (B4). 


(B3) 


(B4) 


